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Abstract 

Two-player games on graphs are central in many problems in formal verification and program analysis such as 
synthesis and verification of open systems. In this work, we consider both finite-state game graphs and recursive 
game graphs (or pushdown game graphs) that can model the control flow of sequential programs with recursion. The 
objectives we consider are multi-dimensional mean-payoff objectives, where the goal of player 1 is to ensure that the 
mean-payoff is at least zero in all dimensions. In pushdown games two types of strategies are relevant: (1) global 
strategies, that depend on the entire global history; and (2) modular strategies, that have only local memory and thus 
do not depend on the context of invocation, but only on the history of the current invocation of the module. We present 
solutions to several fundamental algorithmic questions and our main contributions are as follows: (1) We show that 
finite-state multi-dimensional mean-payoff games can be solved in polynomial time if the number of dimensions and 
the maximal absolute value of weights (or rewards) are fixed; whereas if the number of dimensions is arbitrary, then 
the problem is already known to be coNP-complete. (2) We show that pushdown graphs (or one-player pushdown 
games) with multi-dimensional mean-payoff objectives can be solved in polynomial time. (3) For pushdown games 
under global strategies both single and multi-dimensional mean-payoff objectives problems are known to be undecid- 
able, and we show that under modular strategies the multi-dimensional problem is also undecidable (whereas under 
modular strategies the single dimensional problem is known to be NP-complete). We show that if the number of 
modules, the number of exits, and the maximal absolute value of the weight are fixed, then pushdown games under 
modular strategies with single dimensional mean-payoff objectives can be solved in polynomial time, and if either 
the number of exits or the number of modules is unbounded, then the problem is NP-hard. (4) Finally we show that a 
fixed parameter tractable algorithm for finite-state multi-dimensional mean-payoff games or pushdown games under 
modular strategies with single-dimensional mean-payoff objectives would imply the solution of the long-standing 
open problem of fixed parameter tractability of parity games. 

Keywords: ( 1 ) Finite-state graph games; (2) Mean-payoff objectives; (3) Multi-dimensional objectives; (4) Pushdown 
graphs and games. (5) Computer-aided verification. 
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1 Introduction 



Mean-payoff games on graphs. Two-player games played on finite-state graphs provide the mathematical framework 
to analyze several important problems in computer science as well as mathematics, such as formal analysis of reactive 
systems [ 10 30 29|. Games played on graphs are dynamic games that proceed for an infinite number of rounds. The 
vertex set of the graph is partitioned into player- 1 vertices and player-2 vertices. The game starts at an initial vertex, 
and if the current vertex is a player- 1 vertex, then player 1 chooses an out-going edge, and if the current vertex is 
a player-2 vertex, then player 2 does likewise. This process is repeated forever, and gives rise to an outcome of the 
game, called a play, that consists of the infinite sequence of vertices that are visited. The most well-studied payoff 
criteria in such games is the mean-payoff objective, where a weight (representing a reward) is associated with every 
transition and the goal of one of the players is to maximize the long-run average of the weights (rewards) (and the 
goal of the opponent is to minimize). Mean-payoff games and the special case of graphs (with only one player) with 
mean-payoff objectives have been extensively studied over the last three decades (e.g. [26 , 16 , 35 22 |). Graphs with 
mean-payoff objectives can be solved in polynomial time [26 1, whereas mean-payoff games can be decided in NP n 
coNP lfl6l 1351 . The mean-payoff games problem is an intriguing and one of the rare combinatorial problems that is 
known to be in NP n coNP, but no polynomial time algorithm is known. However, pseudo-polynomial time algorithms 
exist for mean-payoff games [35 , 9 1, and if the weights are bounded by a constant, then the algorithm is polynomial. 

The extensions. Motivated by applications in formal analysis of reactive systems, the study of mean-payoff games has 
been extended in two directions: (1) pushdown mean-payoff games; and (2) multi-dimensional mean-payoff games on 
finite game graphs. Pushdown games (or games on recursive state machines) can model reactive systems with recur- 
sion (or model the control flow of sequential programs with recursion). Pushdown games have been studied widely 
with applications in verification, synthesis, and program analysis in ||34l [33l [3] |2) (also see IfTSl [T9l [7] [6] for sample 
research in stochastic pushdown games). In applications of verification and synthesis, the quantitative objectives that 
typically arise are multi-dimensional quantitative objectives (i.e., conjunction of several objectives), e.g., to express 
properties like the average response time between a grant and a request is below a given threshold v\, and the average 
number of unnecessary grants is below threshold V2 ■ Thus mean-payoff objectives can express properties related to re- 
source requirements, performance, and robustness; multiple objectives can express the different, potentially dependent 
or conflicting objectives. Thus pushdown games and graphs with mean-payoff objectives, and finite-state game graphs 
with multi-dimensional mean-payoff objectives are fundamental theoretical questions in quantitative analysis of re- 
active systems (along with recursion feature). Also pushdown games with multi-dimensional objectives is a natural 
generalization to study. Furthermore, in applications related to reactive system analysis, the number of dimensions of 
mean-payoff objectives is typically small (say 2 or 3) as they denote the different types of resources; and the weights 
are bounded by a constant (they denote the resource consumption amount); whereas the state space of the reactive 
system is huge (see [5 , 8 1 for examples). 

Relevant aspects of pushdown games. In pushdown games two types of strategies are relevant and studied in the 
literature. The first is the global strategies, where a global strategy can choose the successor vertex depending on the 
entire global history of the play (where history is the finite sequence of configurations of the current prefix of a play). 
The second is the modular strategies, and modular strategies are understood more intuitively in the model of games 
on recursive state machines. A recursive state machine (RSM) consists of a set of component machines (or modules). 
Each module has a set of nodes (atomic states) and boxes (each of which is mapped to a module), a well-defined 
interface consisting of entry and exit nodes, and edges connecting nodes/boxes. An edge entering a box models the 
invocation of the module associated with the box and an edge leaving the box represents return from the module. In 
the game version the nodes are partitioned into player- 1 nodes and player-2 nodes. Due to recursion the underlying 
global state-space is infinite and isomorphic to pushdown games. The equivalence of pushdown games and recursive 
games has been established in |3j. A modular strategy is a strategy that has only local memory, and thus, the strategy 
does not depend on the context of invocation of the module, but only on the history within the current invocation of 
the module. Informally, modular strategies are appealing because they are stackless strategies, decomposable into one 
for each module. 

Previous results and open questions. We now summarize the main previous results and open questions and then 
present our contributions. 

1. (Finite-state graphs). Finite-state graphs (or one-player games) with mean-payoff objectives can be solved in 
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polynomial time l26l . and finite-state graphs with multi-dimensional mean-payoff objectives can also be solved 
in polynomial time |32| using the techniques to detect zero-circuits in graphs of l27ll . 

2. (Finite-state games). Finite-state games with single mean-payoff objectives can be decided in NP n coNP |[35l 
[161 , and pseudo-polynomial time algorithms exist for mean-payoff games [35, 9] (the current fastest known 
algorithm works in time 0(n ■ m ■ W), where n is the number of vertices, m is the number of edges, and 
W is the maximal absolute value of the weights [9]). Finite-state games with multi-dimensional mean-payoff 
objectives are coNP-complete with weights in { — 1, 0, 1} (i.e., the weights are bounded by a constant) but with 
arbitrary dimensions IfTTII . and the current best known algorithm works in time 0(2™ • poly(n, m, log W)). 

3. (Pushdown graphs and games). Pushdown graphs and games have been studied only for single mean-payoff 
objectives lfl2l . Under global strategies, pushdown graphs with single mean-payoff objectives can be solved 
in polynomial time, whereas pushdown games are undecidable. Under modular strategies, pushdown graphs 
with every module restricted to have single exit and weights restricted to { — 1, 0, 1} are NP-hard, and pushdown 
games (with any number of exits and general weight function) can be solved in NP (i.e., the problems are 
NP-complete) lfT2l . 

Many fundamental algorithmic questions have remained open for analysis of finite-state and pushdown graphs and 
games with multi-dimensional mean-payoff objectives (where the goal of player 1 is to ensure that the mean-payoff is 
at least zero in all dimensions), such as, (A) Can finite-state game graphs with multi-dimensional mean-payoff objec- 
tives with 2 or 3 dimensions and constant weights be solved in polynomial time? (note that with arbitrary dimensions 
the problem is coNP-complete, and for arbitrary weights no polynomial time algorithm is known even for the single 
dimension case); (B) Can pushdown graphs under global strategies with multi-dimensional mean-payoff objectives 
be solved in polynomial time?; (C) Can polynomial time algorithm be obtained for pushdown games under modular 
strategies with single mean-payoff objectives when relevant parameters (such as the number of modules) are bounded?; 
and (D) In what complexity class does pushdown games under modular strategies with multi-dimensional mean-payoff 
objectives lie? The above questions are not only of theoretical interest, but stems from practically motivated problems 
of formal analysis of reactive systems. 

Our contributions. In this work we provide answers to many of the open fundamental questions. Our contributions 
are summarized as follows: 

1. (Hyper-plane technique). We use the separating hyper-plane technique from computational geometry to an- 
swer the open questions (A) and (B) above. First, we present an algorithm for finite-state games with multi- 
dimensional mean-payoff objectives of ^-dimensions that works in time 0(n ■ m 2 ■ fc • (jfe • n ■ W) k + 2 - k+1 ) 
(Section 2: Theorem [2]), and thus for constant weights and any constant k (not only k = 2 or k = 3) our 
algorithm is polynomial. Second, we present a polynomial-time algorithm for pushdown graphs under global 
strategies with multi-dimensional mean-payoff objectives (Section 3: Theorem[4| (the algorithm is polynomial 
for general weight function and any number of dimensions). Our key intuition is to reduce the multi-dimensional 
problem to searching for a separating hyper-plane such that all realizable mean-payoff value vectors lie on one 
side of the hyper-plane. This intuition allows us to search for a vector (normal to the separating hyper-plane) and 
reduce the multi-dimensional problem to a single dimensional problem by multiplying the multi-dimensional 
weight function by the vector. 

2. (Modular pushdown games). We first show that the hyper-plane techniques do not extend for modular strategies 
in pushdown games: we show that pushdown games under modular strategies with multi-dimensional mean- 
payoff objectives with fixed number of dimensions are undecidable (Section 4: Theorem [5]). Thus the only 
relevant algorithmic problem for pushdown games is the modular strategies problem for single mean-payoff 
objectives (under global strategies even single mean-payoff objectives problem is undecidable lfl2l ). It was 
already shown in lfl2ll that if the number of modules is unbounded, then even with single exits for every module 
the problem is NP-hard. We show that pushdown games under modular strategies with single mean-payoff 
objectives are NP-hard with two modules with weights {— 1, 0, 1} if the number of exits is unbounded (Section 4: 
Theorem|6}. Thus for polynomial time algorithm we need to bound both the number of modules as well as the 
number of exits. We show that pushdown games under modular strategies with single mean-payoff objectives 
can be solved in time (n ■ M)°( M + M ' E ) . W°( M w ijere n is the number of vertices, W is the maximal 
absolute weight, M is the number of modules, and E is the number of exits (Section 4: Theorem|H}. Thus if M, 
E, and W are constants, our algorithm is polynomial. Hence we answer the open questions (C) and (D). 
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3. (Hardness for fixed parameter tractability). Given our polynomial time algorithms when the parameters are 
fixed for finite-state multi-dimensional mean-payoff games and pushdown games with single mean-payoff ob- 
jectives under modular strategies, a natural question is whether they are fixed parameter tractable, e.g., could we 
obtain an algorithm that runs in time f(k) ■ 0(poly(n, m, W)) (resp. f(M, E) ■ 0(poly (n, W))) for finite-state 
multi-dimensional mean-payoff games (resp. for pushdown modular games with single objective), for some 
computable function / (e.g., exponential or double exponential). We show the hardness of fixed parameter 
tractability problem by reducing the long-standing open problem of fixed parameter tractability of parity games 
to both the problems (Section 2: Theorem[3]and Section 4: Theorem[9]), i.e., fixed parameter tractability of any 
of the above problems would imply fixed parameter tractability of parity games. 

2 Finite-State Games with Multi-dimensional Mean-Payoff Objectives 

In this section we will present two results: (1) an algorithm for finite-state multi-dimensional mean-payoff games 
and when the number of dimensions and weights are fixed the running time will be polynomial; and (2) a reduction 
of finite-state parity games to finite-state multi-dimensional mean-payoff games with polynomial weights that shows 
that fixed parameter tractability of multi-dimensional mean-payoff games would imply the solution of long-standing 
open problem of fixed parameter tractability of parity games. We start with the basic definitions of finite-state games, 
strategies, and mean-payoff objectives. 

Game graphs. A game graph G — ((V,E), (Vi, V2)) consists of a finite directed graph (V, E) with a finite set V 
of n vertices and a set E of m edges, and a partition (Vi, V2) of V into two sets. The vertices in V\ are player 1 
vertices, where player 1 chooses the outgoing edges, and the vertices in V2 are player-2 vertices, where player 2 (the 
adversary to player 1) chooses the outgoing edges. Intuitively game graphs are the same as AND-OR graphs. For a 
vertex u E V, we write Out(u) = {v E V \ (u, v) E E} for the set of successor vertices of u. We assume that every 
vertex has at least one out-going edge, i.e., Out(-u) is non-empty for all vertices u E V. 

Plays. A game is played by two players: player 1 and player 2, who form an infinite path in the game graph by moving 
a token along edges. They start by placing the token on an initial vertex, and then they take moves indefinitely in the 
following way. If the token is on a vertex in V\, then player 1 moves the token along one of the edges going out of the 
vertex. If the token is on a vertex in Vi, then player 2 does likewise. The result is an infinite path in the game graph, 
called plays. Formally, a play is an infinite sequence 7r = (vq, «i, i>2, • • .) of vertices such that (vk, Vk+i) £ E for all 
k > 0. We write II for the set of all plays. 

Strategies. A strategy for a player is a rule that specifies how to extend plays. Formally, a strategy r for player 1 is 
a function r: V* ■ Vi — >■ V that, given a finite sequence of vertices (representing the history of the play so far) which 
ends in a player 1 vertex, chooses the next vertex. The strategy must choose only available successors, i.e., for all 
w E V* and v G V\ we have t[w ■ v) E Out(u). The strategies for player 2 are defined analogously. A strategy 
is memoryless if it is independent of the history and only depends on the current vertex. Formally, a memoryless 
strategy for player 1 is a function r: V\ V such that t(v) E Out(w) for all v E V\, and analogously for player 2 
strategies. Given a starting vertex v E V, a strategy r for player 1, and a strategy a for player 2, there is a unique 
play, denoted tt(v, t, a) = (vq, V\, V2, ■ ■ ■}, which is defined as follows: Vq = v and for all k > 0, if Vk E V\, then 
T((vo,Vi,...v k )) = v k +i, andif v k E V 2 , then a((v , v\, v k )) =v k +i. 

Graphs obtained under memoryless strategies. A player- 1 graph is a special case of a game graph where all 
vertices in V2 have a unique successor (and player-2 graphs are defined analogously). Given a memoryless strategy a 
for player 2, we denote by G a the player- 1 graph obtained by removing from all player-2 vertices the edges that are 
not chosen by a. 

Multi-dimensional mean-payoff objectives. For multi-dimensional mean-payoff objectives we will consider game 
graphs along with a weight function w : E — > Z k that maps each edge to a vector of integer weights. For a finite path ir, 
we denote by 10(71") the sum of the weight vectors of the edges in ir andAvg(Tr) = where | 7r| is the length of it, de- 
note the average vector of the weights. We denote by Avg^ (ir) the projection of Avg(7r) to the z-th dimension. For an in- 
finite path 7T, let pi denote the finite prefix of length i of tt; and we define LimlnfAvg, (ir) = liminfn^oo Avg i (p n ) and 
analogously LimSupAvg 1 (n) with lim inf replaced by limsup. For an infinite path tt, we denote by LimlnfAvg(7r) = 
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(LimlnfAvg 1 (7r), . . . , LimlnfAvg fe (7r)) (resp. LimSupAvg(7r) = (LimSupAvg 1 (7r), . . . , LimSupAvg fe (7r))) the limit- 
inf (resp. limit-sup) vector of the averages (long-run average or mean-payoff objectives). In this work, the objective of 
player 1 would be to ensure that the mean-payoff is non-negative in every dimension, i.e., to ensure LimlnfAvg(7r) > 0, 
where denote the vector of all zeros. 

Remark 1. A mean-payoff objective is invariant to shift operation, i.e., if in a dimension i, we require that the mean- 
payoff is at least Vi, then we simply subtract i>j in the weight vector from every edge in the i-th dimension and require 
the mean-payoff is at least in dimension i. Hence the comparison with is without loss of generality. We will present 
all the results for LimlnfAvg objectives and the results for LimSupAvg objectives are simpler. Hence for brevity in 
sequel we will write Li mAvg for LimlnfAvg. Moreover, all the results we will present would also hold if we replace 
the non-strict inequality comparison (> 0) with a strict inequality (> 0). 

Winning strategies. A player- 1 strategy r is a winning strategy from a vertex v if for all player-2 strategies a we have 
LimAvg(7r(i>, r, cr)) > 0. A player-2 strategy is a winning strategy from a vertex v if for all player- 1 strategies t we 
have that the path 7r(u, t, a) does not satisfy LimAvg(7r(v, r, cr)) > 0. 

2.1 Hyper-plane based algorithm 

In this subsection we will present our algorithm to decide the existence of a winning strategy for player 1 in finite-state 
multi-dimensional mean-payoff games. We start with some basic known results. 

Theorem 1 ( H32I ). The following assertions hold: 

1. For finite-state multi-dimensional mean-payoff games, if player 2 has a winning strategy from a vertex v, then 
player 2 has a memoryless winning strategy. 

2. In a player-1 graph G, there is a path it from a vertex v with LimAvg(7r) > if and only if there is a strongly 
connected component (SCC) S reachable from v that contains simple cycles C\, . . . ,Cg and I positive reals 
X\, . . . , xg such that 2~2i=i x i ' w {Ci) > 0, where w(Ci) denotes the sum of edges weights of simple cycle Ci. 

Lemma 1 (Farkas' Lemma). Let v\, . . . , vi be vectors in {±0, ±1, . . . , ±a} k , and let M = (k ■ a) k+1 , then exactly 
one of the following two conditions hold: 

• there exists £ reals ax, . . . , ag, > that are not all zeros, such that Yl%=\ a i ' &i = 0>' 

• there exists a vector A £ {±0, ±1, . . . , ±M} k , such that for every i £ {1, ...,£} we have X T ■ vg < 0, where 
X T is the transpose of X. 

Proof. For the proof of this version of Farkas' Lemma, see l|28l Lemma 21. □ 

Hyper-plane based technique. For the rest of this section we fix M — (k ■ n ■ W) +1 , where W is the maximal 
absoulte value of the weight function. Given a multi-dimensional weight function w and a vector A we denote by 
w ■ X the single dimensional weight function that assigns every edge e the weight value w(e) T ■ A, where w(e) T is 
the transpose of the weight vector w(e). The key intuition derived from Farkas' lemma (Lemma[TJ allows to reduce 
the multi-dimensional problem to searching for a separating hyper-plane such that all realizable mean-payoff value 
vectors lie on one side of the hyper-plane. This intuition allows us to search for a vector A (normal to the hyper-plane) 
and reduce the multi-dimensional problem to a single dimensional problem by multiplying the weight vector with A. 
We now prove two lemmas to formalize the intuition. In the lemmas we require two assumptions, which we later show 
how to deal with. 

Lemma 2. Consider a game graph G = ((V, E), (Vi, V2)) with a weight function w such that (1) E D (V2 x V) C 
E n (V2 x Vi) (i.e., all out-going edges of player-2 vertices is to a player-1 vertex); and (2) every player-1 vertex has 
k self-loop edges e\, . . . , such that Wi(e.j) —0ifi^= j and Wj(ej) = — 1. If player 2 has a winning strategy in the 
multi-dimensional mean-payoff game from a vertex vertex vq, then there exist a vertexvi and X £ {±0, ±1, . . . , ±M} fc 
such that player 2 has a winning strategy in the one-dimensional mean-payoff game over G with weight function w ■ X 
from the initial vertex V\ (note that v\ can be different from vq and the existence of Vi shows that the set of winning 
vertices for player 2 is non-empty). 
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Proof. By TheoremQlitem 1) if there is a winning strategy for player 2 in the multi-dimensional mean-payoff game, 
then there exists a memoryless winning strategy a. We fix the strategy a and consider the player- 1 graph G a . Let S be 
a terminal strongly connected component (or a bottom SCC) in G a that is reachable from vq\ and let C\ , . . . , Cg be the 
simple cycles that occur in S. Since a is a player-2 winning strategy for the multi-dimensional mean-payoff objective, 
it follows by Theorem Q2item 2) that the inequalities Yli=i x i ' w (Ci) > and Xi > do not have a non-trivial 
solution (all Xi = is a trivial solution). By assumption 1 every SCC has at least one player- 1 vertex, and then by 
assumption 2 the SCC S has cycles with weights ( — 1, 0, . . . , 0), (0, — 1, . . . , 0), . . . , (0, 0, . . . , —1), and it follows that 
the equalities 2~2i=i x i ' w (Ci) = subject to Xi > also do not have a non-trivial solution. Since in every simple 
cycle Ci, for all dimensions 1 < j < k we have Wj(Ci) is bounded by n ■ W, then by Lemma[T]there exists a vector 
A G {±0, ±1, . . . , ±M } k such that w(Ci) T • A < for every simple cycle Ci in the SCC S. Hence a is a player-2 
winning strategy for the one-dimensional mean-payoff objective with weight function w ■ A, for all games with initial 
vertices in S. □ 

Lemma 3. Consider a game graph G — {{V 1 E), (Vi, V2)) with a weight function w such that (1) E n (V 2 xV) C 
E n (V 2 x V\); and (2) every player-1 vertex has k self-loop edges e\, . . . , e& such that Wi{ef) = if i ^ j and 
Wi(ei) = — 1. If there exist Vq G V and A G {±0, ±1, . . . , ±M} k such that player 2 has a winning strategy from 
initial vertex vo in the single dimensional mean-payoff game with weight function w ■ A, then player 2 has a winning 
strategy from vq in the multi-dimensional mean-payoff game with weight function w. 

Proof. If player 2 has a winning strategy in the single dimensional mean-payoff game with weight function w ■ A, then 
by TheoremUlitem 1) there exists a memoryless winning strategy a (this result also follows from [ 16]). Observe that 
since a is a winning strategy it follows that for all cycles C in G a that are reachable from vq have negative weight 
under weight function w ■ A, i.e., (w ■ A)(C) < 0. 

We claim that the memoryless strategy a is also a winning strategy from vo in the multi-dimensional mean- 
payoff game. Let S be a strongly connected component in G a reachable from vq and let Ci, . . . , Ci be the simple 
cycles that occur in S. Since w(Ci) • A < for every simple cycle Ci, and since for all 1 < j < k we have 
Wj(Ci) are bounded by n ■ W, then by Lemma[T]we get that the equalities 53i=i x i ' w {Ci) = subject to Xi > 
do not have a non-trivial solution. By the two assumptions (as in the previous lemma) S has cycles with weights 
(— 1, 0, . . . , 0), (0, — 1, . . . , 0), . . . , (0, 0, . . . , — 1), and it follows that the equalities 53<=i x i ' w (^i) ^ subject to 
Xi > do not have a non-trivial solution as well. Hence, by TheoremUlitem 2) it follows that a is a winning strategy 
for player 2 in the multi-dimensional mean-payoff game from vq. □ 

We now present the consequence of the previous two lemmas and mention how to remove the assumptions of the 
lemmas. Given any game graph G there exists a linear transformation to satisfy assumption (1) by simply adding a 
dummy vertex for every out-going edge of a player 2 vertex (i.e., for every edge e = (u, v) with u, v G V%, we add 
a vertex e, edges (u, e) with weight 10(e) and (e, v) with weight 0, and e is a player-1 vertex with a single out-going 
edge). Given a game graph G, let G be the game graph obtained from G after the above reduction that satisfies 
assumption (1), and refer to the transformation as assumption- 1 -transformation. Second we observe that adding the 
self-loop edges of assumption (2) (of Lemma|2]i do not affect winning for player 1, as if there is a winning strategy for 
player 1, then there is one that never chooses the self-loop edges of assumption (2). 

Corollary 1. Consider a game graph G = {{V, E), (Vi , V2)) with a weight function w, and let G be the game graph 
obtained after assumption-1 -transformation. Then player 1 has a winning strategy from all starting vertices in the 
multi-dimensional mean-payoff game G iff player 1 has a winning strategy from all starting vertices in the single 
dimensional mean-payoff game G with weight function w ■ Xfor all A G {±0, ±1, . . . , ±M} fc . 

To use the result of Corollary [T]iteratively to solve finite-state games with multi-dimensional mean-payoff objec- 
tives, we need the notion of attractors. For a set U of vertices, Attr-i (U) is defined inductively as follows: Uo — U and 
foralK > Owe have U l+1 = UiU{v G V x | Out(v) C Ui}l){v G V 2 | Out{v)C\U l ^ 0}, and Attr 2 (U) = U J>0 u ^■ 
Intuitively, from £/j+i player 2 can ensure to reach C/j in one step against all strategies of player 1, and thus Attr 2 (U) 
is the set of vertices such that player 2 can ensure to reach U against all strategies of player 1 in finitely many steps. 
The set Attr 2 (U) can be computed in linear time l23l l4l. Observe that if G is a game graph, then for all U, the game 
graph induced by the set V \ Attr 2 (U) is also a game graph (i.e., all vertices in V \ Attr 2 (U) have out-going edges 
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in V \ Attr2(U)). The following lemma shows that in multi-dimensional mean-payoff games, if U is a set of vertices 
such that player 2 has a winning strategy from every vertex in U, then player 2 has a winning strategy from all vertices 
in Attr2(U), and we can recurse in the game graph after removal of Attr2{U). 

Lemma 4. Consider a multi- dimensional mean-payoff game G with weight function w. Let U be a set of vertices 
such that from all vertices in U there is a winning strategy for player 2. Then the following assertions hold: (1) From 
all vertices in Attr2{U) there is a winning strategy for player 2. (2) Let Z be the set of vertices in the game graph 
induced after removal of Attr2(U) such that from all vertices in Z player 2 has a winning strategy in the remaining 
game graph. Then from all vertices in Z, player 2 has a winning strategy in the original game graph. 

Proof. The proof of the first item is as follows: from vertices in Attr2{U) first consider a strategy to ensure to reach 
U (within finitely many steps), and once U is reached switch to a winning strategy from vertices in U. The proof of 
second item is as follows: fix a winning strategy in the remaining game graph for vertices in Z and a winning strategy 
from Attr2 (U) for player 2. Consider any counter strategy for player 1 . If Attr2 (U) is ever reached, then the winning 
strategy from Attr2(U) ensures winning for player 2, and otherwise the winning strategy of the remaining game graph 
ensures winning. □ 

Algorithm. We now present our iterative algorithm that is based on Corollary [TJ and Lemma E] We first apply 
the assumption- 1 -transformation and then apply the algorithm. In the current iteration i of the game graph execute 
the following step: sequentially iterate over vectors A £ {±0, ±1, . . . , ±(k • n • W) k+1 } k ; and if for some A we 
obtain non-empty set U of winning vertices for player 2 for the one dimensional objective w ■ A in the current game 
graph, remove Attr2{U) from the current game graph and proceed to iteration i + 1. Otherwise if for all A £ 
{±0, ±1, . . . , ±(fc • n ■ W) k+1 } k , player 1 wins from all vertices for the single dimensional objective w ■ A, then the set 
of current vertices is the set of winning vertices for player 1 . The correctness of the algorithm follows from CorollaryQ] 
and Lemma|4] 

Complexity. Note that given a game graph with n vertices and m edges, the assumption- 1 -transformation constructs 
a game graph of m vertices and m edges. The algorithm has at most n iterations, and each iteration solves at most 
O(M) single dimensional mean-payoff games. Thus the iterative algorithm requires to solve 0(n ■ (k -n - W) k '( k+1 ^) 
single dimensional mean-payoff games with m edges, m vertices and the maximal weight is at most k ■ (k -n ■ W) k+1 . 
Since single dimensional mean-payoff games with n vertices, m edges, and maximal weight W can be solved in time 
0(n ■ m ■ W) |0, we obtain the following result. 

Theorem 2. The set of winning vertices for player 1 in a multi-dimensional mean-payoff game with n vertices, m 
edges, k-dimensions, and maximal absolute weight W can be computed in time 0(n ■ m ■ (k-n- W) fe -(fc+i) .k-(k- 
n ■ W) k+1 ) = 0(n-m 2 ■ k ■ (k ■ n ■ W) k2+2 - k+1 ). 

2.2 Hardness for fixed parameter tractability 

In this subsection we will reduce finite-state parity games to finite-state multi-dimensional mean-payoff games with 
weights bounded linearly by the number of vertices. Note that our reduction is different from reduction of parity games 
to single dimensional mean-payoff games where exponential weights are necessary [24] . We start with the definition 
of parity games. 

Parity games. A parity game consists of a finite-state game graph G along with a priority function p : E — y {1, . . . , k} 
that maps every edge to a natural number (the priority). The objective of player 1 is to assure that the minimal priority 
that occur infinitely often in a play is even, and the goal of player 2 is the complement. The memoryless determinacy of 
parity games show that for both players if there is a winning strategy, then there is a memoryless winning strategy ifTTl . 

The reduction. Given a game graph G with priority function p we construct a multi-dimensional mean-payoff 
objective with weight function w of k dimensions on G as follows: for every i £ {1, . . . , k} we assign ^i(e) as 
follows: 

• if p(e) > i; 
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• —1 if p(e) < i andp(e) is odd; and 

• n + 1 if p(e) < i and p(e) is even. 

Lemma 5. From a vertex v, if player 1 wins the parity game, then she also wins the multi-dimensional mean-payoff 
game. 

Proof. If player 1 is the winner in the parity game, then by memoryless determinacy of parity games there is mem- 
oryless winning strategy r. We show that r is also a winning strategy for the multi-dimensional mean-payoff game. 
Assume towards contradiction that player 2 is the winner in the mean-payoff game. Then by TheoremQlitem 1) there 
is a memoryless winning strategy a for player 2. The graph that is formed by r and a is a path that leads to a simple 
cycle C (as both r and a are memoryless). Since a is a winning strategy for player 2 in the multi-dimensional mean- 
payoff game, it follows that for some dimension i we have Wi(C) < 0. Let r be the minimal priority that occur in 
C; by the construction of w it must be that r < i, and in addition, r is odd (otherwise we have Wi(C) > 1). But this 
contradicts the assumption that r is a winning strategy for player 1 in the parity game, and the result follows. □ 

Lemma 6. From a vertex v, if player 2 wins the parity game, then she also wins the multi-dimensional mean-payoff 
game. 

Proof. If player 2 is the winner in the parity game, then by memoryless determinacy she has a memoryless winning 
strategy a. We claim that a is a winning strategy for player 2 in the multi-dimensional mean-payoff game. Consider 
the player- 1 graph G a . Towards contradiction we assume that in G a there is an infinite path 7r with non-negative 
mean-payoff value in every dimension (i.e., LimAvg(7r) > 0). By Theorem [Tfitem 2), it follows that in G a there is 
a SCC reachable from v that contains simple cycles C\, C2, . . . , Ci, such that for some constants x\, . . . , xi (not all 
zeros) we have 2~2j=i x j ' w (Cj) ^ 0- Note that by the construction of w, there are no simple cycles with weight 
exactly 0. Let i* be the minimal dimension such that for some j we have Wi* (Cj) < 0. We consider two distinct 
cases: 

• Case 1: There exists no such i*. In that case, let r be the minimal priority in the cycle C\. Since w r (C\) is 
non-negative, we get that r is even, and a contradiction to the assumption that a is a player-2 winning for the 
parity game. 

• Case 2: Such i* exists. Since Ylj=i x j ' w (Cj) > 0, it follows that for some simple cycle Cj we have (Cj) > 
0, and since i* is minimal, then for every i < i* we have Wi(Cj) > 0. Let r be the minimal priority that occur 
in (7,; as Wi* (Cj) > 0, we get that r < i*. Hence, w r (Cf) > and r is even, and we get a contradiction to the 
assumption that a is a player-2 winning strategy for the parity game. 

To conclude, such set of cycles do not exist, and thus a is a player-2 winning strategy also for the multi-dimensional 
mean payoff game. □ 

Theorem 3. Given a game graph G with a parity objective defined by a priority function of k-priorities we can 
construct in linear time a k-dimensional weight function w with maximal weight W bounded by n + 1 such that 
a vertex is winning for player 1 in the parity game iff the vertex is winning for player 1 in the multi-dimensional 
mean-payoff game. 

Remark 2. There exists a deterministic sub-exponential time algorithm for parity games [25 ] and also algorithms that 
run in time 0(n k / 3 ■ m) H31V : however obtaining a fixed parameter tractable algorithm for parity games that runs in 
time 0(f(k) -poly(n, m))for any function f ( exponential or double exponential) is a long-standing open problem. Our 
reduction (Theorem\3) shows that obtaining a fixed parameter tractable algorithm for multi-dimensional mean-payoff 
games that runs in time 0(f(k) -poly(n, m, W)) is not possible without first solving the fixed parameter tractability of 
parity games. We also point out that the hardness result does not hold for multi- dimensional LimSupAvg-oft/'ecf/vei, 
as if the weights are fixed, the problem can be solved in polynomial time [32]. 
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3 Pushdown Graphs with Multi-dimensional Mean-payoff Objectives 



In this section we consider pushdown graphs (or pushdown systems) with multi-dimensional mean-payoff objectives. 
We start with the basic notion of stack alphabet and commands. 

Stack alphabet and commands. Let T denote a finite set of stack alphabet, and Com(r) = {skip, pop} U {push(z) \ 
z e T} denote the set of stack commands over T. Intuitively, the command skip does nothing, pop deletes the top 
element of the stack, push(z) puts z on the top of the stack. For a stack command com and a stack string a € T + 
we denote by com (a) the stack string obtained by executing the command com on a (in a stack string by the top we 
denote the right end of the string). 

Multi-weighted pushdown systems. A multi-weighted pushdown system (WPS) (or a multi-weighted pushdown 
graph) is a tuple: 

A = (Q,T,q € Q,E C (Q x T) x (Q x Com(r)),w : E^Z k ), 

where Q is a finite set of states with qo as the initial state; T the finite stack alphabet and we assume there is a special 
initial stack symbol _L e T; E describes the set of edges or transitions of the pushdown system; and w is a weight 
function that assigns integer vector weights to every edge; we denote by Wi the projection of w to the i-th dimension. 
We assume that _L can be neither put nor removed from the stack. A configuration of a WPS is a pair (a, q) where 
a e T+ is a stack string and q E Q. For a stack string a we denote by Top(a) the top symbol of the stack. The initial 
configuration of the WPS is (_L, qo). We use W to denote the maximal absolute weight of the edge weights. 

Successor configurations and runs. Given a WPS A, a configuration c i+1 = (a i+ i , Qi+i) is a successor configuration 
of a configuration c, = (o^,^), if there is an edge (<?i,7i, (fc+i, com) e E such that com(cti) = ctj+i, where 
7, = Top(cti). A path wis a sequence of configurations. A path tt = (ci, . . . , c„ + i) is a valid path if for all 1 < i < n 
the configuration Cj+i is a successor configuration of Cj (and the notation is similar for infinite paths). In the sequel 
we shall refer only to valid paths. Let tt — (a, c 2 , . . . , Ci, c i+ i, . . .} be a path. We denote by n[j] = Cj the j-th 
configuration of the path and by ir[ii, i?\ — (c^ , c^+i, . . . , Cj 2 ) the segment of the path from the ii-th to the i2-th 
configuration. A path can equivalently be defined as a sequence {cie\e2 ■ ■ ■ e n ), where c\ is the initial configuration 
and ei are valid transitions. Our goal is to obtain an algorithm that given a WPS A decides if there exists an infinite 
path tt in A from q such that LimAvg(7r) > 0. 

Notations. We shall use (i) 7 or 73 for an element of T; (ii) e or e 4 for a transition (equivalently an edge) from E; (iii) a 
or cti for a string from T*. For a path tt = (ci, c 2 , . . .} = {c\e\e2 . . .} we denote by (i) o> the state of configuration 
Ci, and (ii) ac the stack string of configuration c,. 

Stack height and additional stack height of paths. For a path tt = ((cti, qi), ■ ■ ■ , (a n , q n )), the stack height of tt 
is the maximal height of the stack in the path, i.e., SH(-7r) = max-dail, . . . , \a n \}. The additional stack height 
of tt is the additional height of the stack in the segment of the path, i.e., the additional stack height ASH (71-) is 
SH(7r) - max{|ai|, |a„|}. 

Pumpable pair of paths. Let tt — {c\e\e2 . . .) be a finite or infinite path. A pumpable pair of paths for tt is a pair 
of non-empty sequence of edges: (^1,^2) = (e H e il+1 . . . e il+ni , e i2 e i2+ i . . . e i2+ri2 ), for m, n 2 >Q,i\> and 
«2 > h + ni such that for every j > the path tt^ p ^ obtained by pumping the pair p\ and p 2 of paths j times each 
is a valid path, i.e., for every j > we have 

7r (p 1 .p 2 ) = ( c l e l e 2 ■ • • l(6iiCii+l ■ • • e ii+i n Y e ii+i n +l ■ ■ ■ e i2 - 1 ( e «2 e i2 + l ■ ■ ■ e «2+«2 Y e 'h +"2 ■ ■ •) 

is a valid path. We will show that large additional stack height implies the existence of pumpable pair of paths. To 
prove the results we need the notion of local minima of paths. 

Local minima of a path. Let tt — (ci, C2, . . .) be a path. A configuration a = (an, qi) is a local minimum if for 
every j > i we have on C ctj (i.e., the stack string a, is a prefix string of ay). One basic fact about local minima 
of a path is as follows: Every infinite path has infinitely many local minima. We discuss the proof of the basic fact 
and some properties of local minima. Consider a path tt = (ci, c 2 , . . .}. If there is a finite integer j such that from 
some point on (say after i-th index) the stack height is always at least j, and the stack height is j infinitely often, 
then every configuration after i-th index with stack height j is a local minimum (and there are infinitely many of 
them). Otherwise, for every integer j, there exists an index i, such that for every index after i the stack height exceeds 
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j, and then for every j, the last configuration with stack height j is a local minimum and we have infinitely many 
local minima. This shows the basic fact about infinitely many local minima of a path. We now discuss a property of 
consecutive local minima in a path. If we consider a path and the sequence of local minima, and let Cj and Cj be two 
consecutive local minima. Then either Cj and Cj have the same stack height, or else Cj is obtained from c, with one 
push operation. 

Non-decreasing paths and cycles, and proper cycles. A path from configuration (cry, 91) to configuration (0,^0.2, 92) 
is a non-decreasing a-path if (cry, 91) is a local minimum. Note that if tt is a non-decreasing a-path for some a G V*, 
then it is a non-decreasing /3-path for every (3 G T* . Hence we say that 7r is a non-decreasing path if there exists a G T* 
such that 7r is a non-decreasing a-path. A non-decreasing cycle is a non-decreasing path from (ax, q) to (a2, 9) such 
that the top symbols of ct\ and oli are the same. A non-decreasing cycle from (ax, 9) to («2, 9) is a proper cycle if 
ai = ct2 (i.e., returns to the same configuration). By convention, when we say that a path 7r is a non-decreasing path 
from (71, qi) to (72, 92), it means that for some ax, 02 G T*, the path 7r is a non-decreasing path from (ax 71, 9x) to 
(ai7ia 2 72,<?2)- 

Cone of pumpable pairs. We denote R + = [0, +00). For a finite non-decreasing path tt we denote by 
PPS(7r) the (finite) set of pumpable pairs that occur in tt, that is, PPS(7r) = {(pi,P2) G (E* x E*) 
pi and P2 are a pumpable pair in tt}. Let PPS(7r) = {Pi = (p\,p\),P2 = (PxjPI)) • • • > Pj = (PxiPi)}' and we 
denote by PumpMat(Tr) the matrix that is formed by the weight vectors of the pumpable pairs of tt, that is, the matrix 
has j rows and the i-th row of the matrix is w(p\) + w(p 2 ) (every weight vector is a row in the matrix). We denote by 
PCone(7r) the cone of the weight vectors in PPS(7r), formally, PCone(7r) = {PumpMat(7r) • x | x G (R+\{0)}}. 

Fix £ = (|Q| • |r|)d ( 3l'l r l) 2+1 for the rest of the section. For 91,92 G Q and 71,72 G T, by abuse of 
notation we denote by PPS((7x, q%), (72, 92)) the (finite) set of all pumpable pair of paths, not longer than £, 
that occur in a non-decreasing path from (71,91) to (72,92); we similarly define PumpMat((7i, qi), (72,92)) and 
PCone((7i,9i), (72,92))- If 9i = 92 and 71 = 72, then we abbreviate PPS((7i,9i), (71,91)) by PPS((7x, ?x)), and 
similarly PumpMat and PCone. The next lemma was proved in [ 12|. 

Lemma 7 ( B12I ). Let tt be a finite path such that ASH(7r) = d > (\Q\ ■ \T\) 2 . Then tt has a pumpable pair of paths. 

In the next lemma we show that any sufficiently long non-decreasing path contains a pumpable pair of paths. 
Lemma 8. Every non-decreasing path longer than £ has a pumpable pair of paths. 

Proof. Let tt be a non-decreasing path longer than £. If ASH(7r) > (|Q| • |r|) 2 , then by Lemma|7]we get the desired 
result; otherwise, it is an easy observation that tt contains a proper cycle, which is by definition a pumpable pair of 
paths (where one path in the pair is empty). □ 

Corollary 2. Every non-decreasing path longer than £ has a pumpable pair of paths with length at most £. 

The next two lemmas show basic properties of PPS. The first lemma asserts that we can decompose every non- 
decreasing path to a set of pumpable pairs and a short non-decreasing path. 

Lemma 9. For every non-decreasing tt path from (71, 91) to (72, 92) there exists a tuple of pumpable pair of paths 
Pi = (p\,p\),P2 = (pl,pl),...,Pj = {p{,P 2 ) 6 PPS((7x, 91), (72,92)) each of length at most £ (i.e., for all 
1 < i < J we have \Pi\ < £), a finite non-decreasing path ttq from (71, 91) to (72, 92) with length at most £, and 
non-negative constants mi, . . . , raj such that w(tt) = w(ttq) + X)i=i m i ' w (Pi) an d M = Ko| + Si=i m i ' l-^l- 

Proof. The proof is by induction of the length of tt. If \tt\ < £, then we are done by choosing j = and ttq = tt. 
Otherwise, by Corollary 12 the path has a pumpable pair P = (pi,P2) with length less than £ (and hence P G 
PPS((9i, 71), (92, 72)))- Let tt* be the path that is obtained from tt by pumping P zero times; clearly tt* is a non- 
decreasing path from (71,91) to (72,92) and shorter than tt, any by the induction hypothesis we get the desired 
result. □ 

The following lemma shows the connection between the average weight of a path and PPS. 

Lemma 10. If PCone((7i, 91), (72, 92)) H K^L = 0, then there exist constants e > and m G N, such that for every 
finite non-decreasing path tt from (71, 91) to (72, 92), there exists a dimension t such that W^tt) < m — e • \tt\. 
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Proof. In order to define e, we consider the following linear programming problem with the variables x\,xi,... and 
r: the objective function is to maximize r subject to the constraints below 



x z -Wi(z)>r fori = l,...,fc (1) 

zePPS(( 7 i,gi), (72,92)) 

zGPPS((7i,<2i), (72:92)) 

x z >0 for all z e PPS(( 7 i,<Zi), (72,92)) (3) 

Intuitively, the first constraint specifies that there is a convex combination of the weights of the pumpable pair to ensure 
at least r in every dimension; and the following two constraints is to ensure that it is a convex combination. As the 
domain of the variables is closed and bounded, there exists a maximum value to the linear program, and let r* be the 
maximum value. If r* > 0, then we get a contradiction to the assumption that PCone((7i, qi), (72, (72)) H Hv? = 0. 
Hence we have r* < 0. We define m — (£+ 1) • W, e — — 7 — and we claim that for every non-decreasing path tt from 
(71, qi) to (72, 92) there is a dimension i such that Wi(Tr) < m — e ■ \tt\. 

Indeed, by Lemma [9] there exists a path ttq with length at most £, a (finite) sequence of pumpable pairs 
P%,...,Pj 6 PPS(( 7 i, qi), (72, 92)) each of length at most £ and constants mi, . . . ,mj such that w(ir) = 
wi^o) + Si=i TO « ' w (Pi) and M — Kol + Si=i m « ' We define M = 53i=i m v As a ^ 1^1 an d \ w o\ are 
bounded by we get that M > ^7-^ . In addition there exists a dimension i for which jj 5Z^ =1 ti^ • wt{Pi) < r*. 
Hence Wt(7r) < u;t(7ro) + A/ • r* and since r* < we have 

Id - £ r* 
w t {ir) < w t (ir ) H r* = w t (ir ) - r* + \n\ ■ —. 

Therefore, for the choice of m > £ ■ W ~ r* and e = -j, we obtain the desired result. □ 

The next proposition gives a sufficient and necessary condition for the existence of a path with non-negative mean- 
payoff values in all the dimensions. 



Proposition 1. There exists an infinite path 7r such that LimlnfAvg(7r) > if and only if there exists a (reachable) 

k 

+ 



non-decreasing cycle tt such that R*^ D PCone(7r) 7^ 



Proof. We first prove the direction from right to left. If there exists such a path tt, then by definition there are j 
pumpable pairs Pi, P 2 , . . . , Pj with weight vectors j/i = w(Pi), ■ ■ ■ ,Uj = w(Pj) such that there exist j positive 
constants (w.l.o.g natural positive constants) n%, . . . , rij such that n i ' ^« — 0. For every a, b e N we denote by 

TT a ' b the (finite) path that is formed by pumping the a-th pumpable pair b times. We denote by TT b = Tr 1,b ' ni ■ 7r 2 ' 6 '™ 2 • 

. . . ^ wnere the j_th pumpable pair is pumped b ■ n.i times, respectively. We note that 7r a ' b is a non-decreasing 

cycle, and for the infinite path 



^* £1 Cfl Cfl 
TT = TT ■ TT ■ TT ■ ■ ■ TT ■ 



we get LimAvg(7r*) > 0. The reason we have LimAvg(7r*) > is as b tends to infinity, the average weight is 
determined only by the weights of the j pumpable pairs and their coefficients n%, . . . ,rij, and we have J2l=i n i'Vi — 0- 
This completes the proof for the direction from right to left. 

For the converse direction, let tt be an infinite path such that LimAvg(7r) > 0, and let (7, q) be a top configuration 
that occur infinitely often in the local minima of tt. Since LimAvg(7r) > it follows that for every e > there exists 
a non-decreasing cycle that begins at (7, q) with average weight at least — e in every dimension. Hence, by LemmafTol 
it follows that PCone( 7 , q) n ^ 0, and hence, there exists a non-decreasing cycle tt that starts in (7, q) for which 
PCone(Tr) n ^ 0. □ 

By Proposition [T] we can decide whether there is an infinite path it for which LimAvg(7r) > by checking 
if there exist a tuple (7,(7) 6 T x Q for which there is a non-negative (and non-trivial) solution for the equation 
PumpMat((7, q)) ■ x > 0. As in Lemma |2]by adding k self-loop transitions with weights, where the weight of 
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transition i is — 1 in the i-th dimension and in the other dimensions, we reduce the problem to finding q and 7 such 
that there is a non-negative solution for PumpMat((7, q)) ■ x = 0. Inspired by the techniques of IPT41 . we present 
an algorithm that solves the problem by a reduction to a corresponding one dimensional problem. As before given 
a fc-dimensional weight function w and a fc-dimensional vector A we denote by w ■ A the single dimensional weight 
function obtained by multiplying the weight vectors with A. The reduction to single dimensional objective requires 
the use of Gordan's lemma. 

Lemma 11 (Gordan's Lemma [20]). For a matrix A, either A ■ x = has a non-trivial non-negative solution for x, 
or there exists a vector y such that y ■ A T is negative in every dimension. 

The next lemma suggests that we can reduce the multi-dimensional problem to a corresponding single dimensional 
problem. 

Lemma 12. Given a WPS A with a k-dimensional weight function w, and (7, q) G T x Q, there exists a non-trivial 
non-negative solution for PumpMat((7. q)) ■ x = if and only if for every Aei 1 there is a non-decreasing path 
from (7, q) to (7, q) that contains a pumpable pair P = (pi , P2) such that (w ■ A) (P) > ( i.e., the weight of the path 
for single dimensional weight function w ■ X is non-negative). 

Proof. The proof is straight forward application of Gordan's Lemma to the matrix PumpMat((7, q)). □ 

Proposition 2. There is a polynomial time algorithm that given WPS A with k-dimensional weight function w, (7, q) G 
r x Q, a vector A G Q fc , and a rational number r G Q decides if there exists a pumpable pair of paths P in a non- 
decreasing cyclic path that begins at (7, q) in A, with — | pf P ^ > T and \P\ < £, and if such pair exists, it returns 

w(P) 

\p\ • 

Intuitively, the algorithm is based on the algorithm for solving WPSs with single dimensional mean-payoff objec- 
tives. We postpone the technically detailed proof to Section lXTl We first show how to use the result of the proposition 
and a result from linear programming to solve the problem. We first state the result for linear programming. 
Linear program with exponential constraints and polynomial-time separating oracle. Consider a linear program 
over n variables and exponentially many constraints in n. Given a polynomial time separating oracle that for every 
point in space returns in polynomial time whether the point is feasible, and if infeasible returns a violated constraint, 
the linear program can be solved in polynomial time using the ellipsoid method ll2Tl . We use the result to show the 
following result. 

Proposition 3. There exists a polynomial time algorithm that decides whether for a given state q and a stack alphabet 
symbol 7 there exists a non-trivial non-negative solution for PumpMat((7, q)) ■ x = 0. 

Proof. Conceptually, given q and 7, we compute a matrix A, such that each row in A corresponds to the average 
weight vector of a row in PumpMat((7, q)) (that is, the weight of a pumpable pair divided by its length), and solves 
the following linear programming problem: For variables r and A = (Ai, . . . , Afe), the objective function is to minimize 
r subject to the constraints below: 

A • A T < f where r = (r, , r) T . (4) 

k 

X] ^ = 1 (5) 

i=l 

Once the minimal r is computed, by Lemma IT2l there exists a solution for PumpMat((7, q)) ■ x = if and only if 
r > 0. 

The number of rows of A in the worst case is exponential (to be precise at most £ ■ (2 • W ■ £) k , since the length of 
the path is at most £, the sum of weights is between — W ■ £ and W ■ £ and there are k dimensions). However, we do 
not enumerate the constraints of the linear programming problem explicitly but use the result of linear programs with 
polynomial time separating oracle. By Proposition |2] we have an algorithm that verifies the feasibility of a solution 
(that is, an assignment for A and r) and if the solution is infeasible it returns a constraint that is not satisfied by the 
solution. Thus the result of Proposition [2] provides the desired polynomial-time separating oracle and we have the 
desired result. □ 
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Hence, we get the following theorem. 

Theorem 4. Given a WPS A with k-dimensional weight function w, we can decide in polynomial time whether there 
exists a path tt such that LimAvg(7r) > 0. 

3.1 Technical details proof of Proposition |2] 

In this section we prove Proposition [2] Throughout this section, we assume WLOG that A is a vector of integers and 
that r = 0. Intuitively the solution is very similar to solving WPS with single dimensional objectives, with some 
technical and tedious modifications. We will present the relevant details. Let A be a WPS with fc-dimensional weight 
function w, and w ■ X be the single dimensional weight function. Let d — (|Q|-|r|) 2 + l. We now recall the notion 
of summary function as defined in [ 12 1. In the definition of summary function below we consider the weight function 
w ■ X. 

Summary function. Let A be a WPS. For a G T* we define s a : Q x V x Q — » {— oo} U Z U {ui} as following. 

L s a ((?i,7, q2) = cj iff for every nel there exists a non-decreasing path from (cry, qi) to (cry, (72) with weight 
at least n. 

2. s a (qi, 7, 02) = z G Z iff the weight of the maximum weight non-decreasing path from configuration (aj, q\) 
to configuration (cry, 92) is z. 

3- s a (qx, 7, 92) = —00 iff there is no non-decreasing path from (cry, qi) to (07, (72)- 

Remark 3. For every a%,a2 G T*: s Ql = s a2 . 

Due to Remarket is enough to consider only s = s±. The computation of the summary function will be achieved 
by considering stack height bounded summary functions defined below. 

Stack height bounded summary function. For every d G N, the stack height bounded summary function : Q x T x 
Q — > { — 00} UZU {ui} is defined as follows: (i) Sd(qi, 7, 52) = w iff for every n G N there exists a non-decreasing 
path from (-L7, qi) to (±7,(72) with weight at least n and additional stack height at most d\ (ii) Sd(qi, 7, 92) = 2 
iff the weight of the maximum weight non-decreasing path from (-L7, q\) to (-L7, 92) with additional stack height at 
most d is z; and (iii) Sd(qi, 7, (72) = —00 iff there is no non-decreasing path with additional stack height at most d 
from (J-7, qx) to (-L7, 52)- Before presenting the key lemma we recall the computation of s J+ i from s,; that will also 
introduce the relevant notions required for the lemma. 

Computation of si + ifrom s, and A. Let Ga be the finite weighted graph that is formed by all the configurations of A 
with stack height either one or two, that is, the vertices are of the form (a, q) where q G Q and a G {1. ■ 7, _L • 71 • 72 | 
7> 71,72 £ r}. The edges (and their weights) are according to the transitions of A: formally, (i) (Skip edges): for 
vertices (_L • a, q) we have an edge to (_L • a, q') iff e = (q, Top(a), skip, q') is an edge in A (and the weight of the 
edge in Ga is w(e)) where a = 7 or a = 71 • 72 for 7, 71, 72 G T; (ii) (Push edges): for vertices (_L • 7, q) we have 
an edge to (_L • 7 • 7', q 1 ) iff e = (q, 7, push(^') : q') is an edge in A (and the weight of the edge in Ga is w(e)) for 
7, 7' G T; and (iii) (Pop edges): for vertices (_L ■ 7 • 7', q) we have an edge to (_L • 7, q') iff e = (q, 7', pop, g') is an 
edge in A (and the weight of the edge in Ga is w(e)) for 7, 7' G T. Intuitively, Ga allows skips, push pop pairs, and 
only one additional push. Note that Ga has at most 2 • \Q\ ■ \T\ 2 vertices, and can be constructed in polynomial time. 

For every i > 1, given the function Si, the graph G\ is constructed from Ga as follows: adding edges 
((-17172,91), (J_7i72, (72)) (if the edge does not exist already) and changing its weight to Si(q%, 72, 92) for every 
7i, 72 £ T and c/i , c/2 G Q. The value of s J+ i(gi, 7, 52) is exactly the weight of the maximum weight path between 
(J-7, qi) and (-L7, (72) in G\ (with the following convention: — 00 < z < uj, z+uj — uj and z+— 00 = u+— 00 = — 00 
for every z G Z). If in G\ there is a path from (-L7, q\) to (-L7, 52) that contains a cycle with positive weight, then 
we set Si+i(qi, 7, (72) = w. Hence, given and ^4, the construction of is achieved in polynomial time, and the 
computation of s J+ i is achieved using the Bellman-Ford algorithm lfl5l in polynomial time (the maximum weight path 
is the shortest weight if we define the edge length as the negative of the edge weight). Also note that the Bellman-Ford 
algorithm reports cycles with positive weight (that is, negative length) which is required to set u> values of Sj+j.. It 
follows that we can compute Si + i given s,; and A in polynomial time. In the computation of the summary function s* 
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we also store along with Si(qi, 7, q 2 ) the weight vector w(P) and the length \P\ of a witness path P that is maximal 
weight (according to w ■ A) shortest non-decreasing path from (7, qi) to (7, q 2 ) with additional stack height at most i. 
We denote by VECT(si(qi, 7, q 2 )) the tuple (w(P), \P\). 

Lemma 13. Let qi,q 2 G Q, 7 G r and d > (|Q||r|) 2 , such that s d (qi, 7, q%) > s<j-i(3l) 7, q^), and let ir be the 
shortest non-decreasing path from (-L7, qi ) to (J-7, q 2 ) with weight Sd+i(qi, 7, q 2 ) and additional stack height d, then 
the following assertions hold: 

1. The path n contains a pumpable pair of paths P = (j>i,p 2 ) with (w ■ A)(P) > with length at most L 

2. We can compute w(P), and ^jjip in polynomial time. 

Proof. The first item was proved in |[T2l . For the second item, we consider the graphs G\ as defined above. Then for 
G\, we compute (based on the summary function s d ) the heaviest non-decreasing path p from (-L7, qi) to (J_7, q 2 ). 
In the path p, we find a sub-path of the form (-L7, z), (-L^S, q 1 ), (-L76, q"), (-L7, z 1 ), for which 

• s d (z,7,z') > s d _i(z,7, z'); and 

• Sd,-i(q',5,q") > s d ^ 2 (q',S,q"); 

(note the by definition, such sub-path must exists.) We store the value of the heaviest weight from (-L7, qi) to (-L7, z), 
and from (±7,2') to (±7,52)- We also store the push and pop transitions and the corresponding vector of the 
weight function w, and repeat the process, recursively, for the heaviest non-decreasing path from (8, q') to (5, q") with 
ASH(<i — 1). We end up with a description of length 0{d) of the form 

P = (-LTi^iJ^ (J-Tljfe) (-L7172, 5iJ ^ (-L71 72,92 ) (^717273, 9i) (-L717273, ^2) 

••• -)■ (±71 . . . 7 d , gf) ~» (±71 . . . 7 d , ql) (±71 . . .7^-1,93 ) ~> (-L71 ■ • ■ 7d-l, <7 4 ) 

P°?2 / 1 d-2\ P d + 3 / 1 d-2\ P°%3 P°Pd 1 , i\ P2 d+i / . !> 

(±71 . . .7d- 2 ,g 3 ) (±7i...7d_2,?4 )->■••• -4 (±71,93) (-L7i,9i); 

where qi = q\, q\ = 52 and 71 = 7. Intuitively, the path p* is decomposed as the path 
pi push l p 2 push 2 ■■■ push d p d+ i pop l p d+2 ■■■ pop d p 2 . d+ i, where the p x realizes the value s d {q\,^i,q\), 
p 2 realizes the value Sd-i{q\ , 72, q 2 ) and so on; and similarly p d +i realizes the value so(qf , 7<j, q 2 ), p d + 2 realizes 
the value si(gg _1 , 7^-1, gf -1 ), p d +3 realizes the value s 2 (q^~ 2 , j d - 2 , q^~ 2 ) and so on; and finally, p 2 . d +i realizes 

sd{ql,n-,q\)- 

Since d > (\Q\ ■ |r|) 2 , there must exist 1 < i < j < d, and hi, h 2 , /13, /14 6 {1, . . . , 4} such that q l hi = q J h , 
1h 3 = 9/i 4 ' 7» = 7j' anc ' me we ight of the path from (±71 . . • 7», q\ x ) to (-L71 • • ■ 7j , <?^ 2 ) plus the weight of the path 
from (-L71 . . . 7j, q l h3 ) to (-L71 • • ■ 7i, g^ 4 ) is positive. We sequentially iterate over all such tuples of hi, h 2 , /13 
and I14 in polynomial time, and a witness path P can be obtained as of the form of p*. The computation of w(P) and 
■^jpy- is obtained from the vector of the summary function, and the push and pop transitions along with the vector of 
weights according to w of such transitions, i.e., 

d 2-d+l 

(w(P),\P\) = (Y J w{push i ) + w{pop i ),2-d)+Y,'< w ^\P i \) 

i=l i=l 
d d+1 2d+l 

= (^w(pwh i ) + w(pop i ),2-<£)+^\mCT(aM^(qi,7i,(&))+ VECT(s i _ d _ 1 (g|,7» 5 

1=1 i=l i=d+2 

Hence it follows that we can compute w(P) and w ^ in polynomial time and the proof follows. □ 

Our goal now is the computation of the u> values of the summary function. To achieve the computation of u 
values we will define another summary function s* and a new WPS A* such that certain cycles in A* will characterize 
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the to values of the summary function. We now define the summary function s* and the pushdown system A*. Let 
d — (\Q\ ■ |r|) 2 . The new summary function s* is defined as follows: if the values of S4 and s^+i are the same then it 
is assigned the value of Sd, and otherwise the value uj. Formally, 



The new WPS A* is constructed from A by adding the following set of w-edges: {(qi, 7, q 2 , skip) \ s*(qi, 7, 92) = 



Lemma 14 ( B12I1 ). For all 91,52 £ Q and 7 £ T, the following assertion holds: the original summary function 
s(qi , 7, 92) = ^ iff there exists a non-decreasing path in A* from (-L7, qi) to (-L7, 92) ffatf goes through an uj-edge. 

We will now present the required polynomial-time algorithm for Proposition |2j and we present the algorithm for 
the case with r = (and this is without loss of generality). The algorithm is similar to solution of WPS with single 
dimensional objective of 1 12|. The final ingredient is the notion of summary graph. 

Summary graph and positive simple cycles. Given a WPS A = (Q, T, qo £ Q, E C (Q x T) x (Q x Com(r)), w ■ A : 
E — > Z) and the summary function s, we construct the summary graph Gr(.4) = (V, E) of A with a weight 
function w : E -> Z U {u} as follows: (i) V = Q x T; and (ii) 23 = E'sfcip U E pus h where E s ki P = 
{((91,7), (92,7)) I s(qi,j,q 2 ) > -oo},andE push = {(( qi , 71), (q 2 , 72)) I (gi, 71)92,^^(72)) £ -E 1 }; and (iii^for 
all e = ((91,7), (92,7)) € ^s«p we have w(e) = s(gi, 7, q 2 ), and for all e £ 2? pU57( we have W(e) = (iu • A(e) 
(i.e., according to weight function of ^l). A simple cycle C in Gr(A) is a positive simple cycle iff one of the following 
conditions hold: (i) either C contains an cj-edge (i.e., edge labeled oj by w); or (ii) the sum of the weights of the 
edges of the cycles according to w is positive. The summary functions and the summary graph can be constructed in 
polynomial time. The first step of the algorithm is to build the summary graph and to check if there is a path from 
(7, q) to (7, q) with a positive weight. We consider the following cases of existence of such a positive weight path. 

1. If there is no such path, then there does not exist pumpable pair of paths P = (pi,p2) with positive weight (i.e., 
there exists no pumpable pair P with (w ■ X)(P) > 0). 

2. We now consider the case when such a positive weight path exists. If such a path exist, we consider the path with 
maximum weight that is shortest (i.e., among the ones with maximum weight we choose a path that is shortest). 
We have two distinct cases. 

(a) We first consider the case when the path do not go through an u> edge. Then the path does not have 
a pumpable pair for the following reason: if the pumpable pair is positive, then the weight is not the 
maximum, and if the pumpable pair is non-negative, removing it ensures we obtain a maximum weight 
path with shorter length. Hence the length of the path is at most I, Since we have stored the vector of the 
summary function (which stores the weights according to w and length of the witness paths) we compute 



(b) Otherwise, the path goes through an uj edge in the summary graph. If there is an u) edge due to a proper 
cycle with positive weight, then we can detect this cycle in the construction of the summary graph and 
compute its average weight according to w (since we have the vector of the summary function that stores 
the weight according to w and the length of the witness paths). Otherwise, by Lemma [T4l it follows that 
there is a non-decreasing path from (7, q) to (7, q) that has a non-decreasing sub-path from (8, qi) to 
(S, 92) and Sd+i(qi,6, q 2 ) > Sd(qi,S, 92)- We have already described a polynomial time algorithm for 
finding such q\, q 2 and 8. Once we find qi, q 2 and 8, by Lemma [T3l we can compute w(P) and "j^ in 
polynomial time. 

The proof of Proposition|2]follows. 




the weight of this path according to w (and not according to w ■ A), and return the average weight of this 
path. 
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4 Recursive Games under Modular Strategies with Mean-payoff Objectives 



In this section we will consider recursive games (which are equivalent to pushdown games) with modular strategies. 
Note that there is no intuitive interpretation of modular strategies for pushdown games and it is standard (as considered 
in all works in literature) to define and consider recursive games in the context of modular strategies. We start with the 
definitions and present four results for mean-payoff objectives in such games: (1) we show undecidability for multi- 
dimensional problem, and hence focus on the single dimensional case; (2) for the single dimensional case we show a 
NP-hardness result; (3) we present an algorithm that runs in polynomial time when relevant parameters are fixed; and 
(4) finally we show a reduction from finite-state parity games to show the hardness of fixed parameter tractability. 

Weighted recursive game graphs (WRGs). A recursive game graph A consists of a tuple (A , A\, . . . , A„) of game 
modules, where each game module Aj = (Ni, Bi, V*, V 2 , Erii, Exi,8i) consists of the following components: 

• A finite nonempty set of nodes N{ . 

• A nonempty set of entry nodes Erii C Ni and a nonempty set of exit nodes Exi C JVj. 

• A set of boxes Bi . 

• Two disjoint sets and V? that partition the set of nodes and boxes into two sets, i.e., V* U V 2 = Ni U -B, 
and Vi n V? = 0. The set (resp. V 2 ) denotes the places where it is the turn of player 1 (resp. player 2) to 
play (i.e., choose transitions). We denote the union of V/ and V 2 by Vi. 

• A labeling Yi : Bi — ► {1, . . . , n} that assigns to every box an index of the game modules A\ . . . A n . 

• Let Calls; = {(b, e) | b G B L ,e G Erij,j = Yi(b)} denote the set of calls of module Aj and let Retns; = 
{(b, x) | b G Bi, x G Exj,j — Yi(b)} denote the set of returns in Ai. Then, Si C (JVj U Retns.;) x (Ni U Calls.;) 
is the transition relation for module Ai. 

A weighted recursive game graph (for short WRG) is a recursive game graph, equipped with a weight function w on 
the transitions. We also refer the readers to |3 j for detailed description and illustration with figures of recursive game 
graphs. WLOG we shall assume that the boxes and nodes of all modules are disjoint. Let B = (J. Bi denote the set 
of all boxes, N = (J. Ni denote the set of all nodes, En = [j i Erii denote the set of all entry nodes, Ex = (L Ex.- L 
denote the set of all exit nodes, V 1 — (J 4 V/ (resp. V 2 = (J i V 2 ) denote the set of all places under player l's control 
(resp. player 2's control), and V = V 1 U V 2 denote the set of all vertices. We will also consider the special case 
of one-player WRGs, where either V 2 is empty (player- 1 WRGs) or V 1 is empty (player-2 WRGs). WLOG we will 
assume that the every module has a unique entrance (a polynomial reduction to module with many entrance to one 
with single entrance was given in Q). The module A$ is the initial module, and its entry node the starting node of the 
game. 

Configurations, paths and local history. A configuration c consists of a sequence (pi, . . . , b r , it), where b\, . . . , b r G 
B and u G N. Intuitively, b\, . . . ,b r denote the current stack (of modules), and n is the current node. A sequence of 
configurations is valid if it does not violate the transition relation. The configuration stack height of c is r. Let us denote 
by C the set of all configurations, and let Ci (resp. C2) denote the set of all configurations under player l's control 
(resp. player 2's control). A path ir = (ci, C2, C3, . . .} is a valid sequence of configurations. Let p = (c\,C2, ■ ■ ■ , c&) 
be a valid finite sequence of configurations, such that Cj = (b\, . . . ,b l d ., Ui), and the stack height of Cj is di. Let ci be 
the first configuration with stack height di = dk, such that for every i < j < k, if Cj has stack height di, then Uj Ex 
(uj is not an exit node). The local history of p, denoted by LocalHistory(p), is the sequence (uj ± , . . . ,Uj m ) such that 
Cji = Ci, Cj m — Ck, ji < J2 < ■ ■ ■ < jm, and the stack height of Cj x , . . . ,Cj m is exactly di. Intuitively, the local 
history is the sequence of nodes in a module. Note that by definition, for every p G C*, there exists i G {1, . . . , n} 
such that all the nodes that occur in Local History (p) belongs to Vi. We say that LocalHistory(p) G A, if all the nodes 
in Local History (p) belongs to Vi. 

Global game graph and isomorphism to pushdown game graphs. The global game graph corresponding to a 
WRG A = (Ai, . . . , A n ) is the graph of all valid configurations, with an edge (ci, C2) between configurations ci and 
C2 if there exists a transition from ci to c^. It follows from the results of [3] that every recursive game graph has an 
isomorphic pushdown game graph that is computable in polynomial time. 
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Figure 1 : Module A 



Plays, strategies and modular strategies. A play is played in the usual sense over the global game graph (which 
is possibly an infinite graph). A (finite) play is a (finite) valid sequence of configurations (c\, c 2 , c 3 , . . .} (i.e., a path 
in the global game graph). A strategy for player 1 is a function r : C* x Ci — > C respecting the edge relationship 
of the global game graph, i.e., for all w G C* and ci G Ci we have that (c\,t{w ■ c\)) is an edge in the global 
game graph. A modular strategy t for player 1 is a set of functions {rj}™ =1 , one for each module, where for every 
i, we have r» : (Ni U RetnSi)* — > £j. The function t is defined as follows: For every play prefix p we have 
t(p) = Ti(LocalHistory(p)), where Local History (p) G A%. The function r, is the local strategy of module Aj. 
Intuitively, a modular strategy only depends on the local history, and not on the context of invocation of the module. 
A modular strategy r = {Ti}" =1 is a finite-memory modular strategy if Tj is a finite-memory strategy for every 
i G {1, . . . , n). A memoryless modular strategy is defined in similar way, where every component local strategy is 
memoryless. 

Mean-payoff objectives and winning modular strategies. The modular winning strategy problem asks if player 1 
has a modular strategy r such that against every strategy a for player 2 the play 7r given the starting node and the 
strategies satisfy LimAvg(-7r) > (note that the counter strategy of player 2 is a general strategy). 

4.1 Undecidability for multi-dimensional mean-payoff objectives 

In this section we will show that the problem of deciding the existence of modular winning strategy for player 1 
in WRGs with multi-dimensional mean-payoff objectives is undecidable. The reduction would be from reachability 
games over tuples of integers. We start by introducing these games. 

Reachability games over Z fe . A reachability game over Z fe consists of a finite-state game graph G, a k dimensional 
weight function w : E — > Z fe , and an initial weight vector v G Z fc . An infinite play it is winning for player 1 if there 
exists some finite prefix n' C ir such that w(n') + V = and the last vertex in n' is a player- 1 vertex. 

Lemma 15. The following problem is undecidable: Given a reachability game over 1? and a starting vertex v, decide 
if there is a winning strategy t for player 1 to ensure that for all strategies a for player 2 the play 7t(t, a, v) is winning 
for player 1. 
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(0,0,0,0,+ l,-l) 
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Figure 2: Module A\ 




Figure 3: Module A 2 



Proof. We make a simple observation that the undecidability proof for reachability games over N 2 (e.g., see fl~|) is 
easily extended to games over Z 2 . □ 

We will present a general reduction from reachability games over 1 k to WRGs under modular strategies with 
multi-dimensional mean-payoff objectives of 2 • k + 2 dimensions, with three modules (two of them with single exit, 
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and an initial module without any exits). Given a reachability game over 7L k with game graph G, weight function w 
and initial vector z7, we construct a WRG graph A — (Aq, Ai, A2) with a weight function of 2 • k + 2 dimensions in 
the following way. 

• Module Aq: This module repeatedly invokes Ai and A2 (one call to A\ and one call to A2); and all the weights 
of the transitions are 0. 

• Module A\\ This module has three nodes: entrance, exit and an additional one with a self-loop edge with weight 
in the first 2 • k dimensions, weight +1 in dimension 2 • k + 1 and weight — 1 in dimension 2 ■ k + 2; the weight 
of the edges from the entrance node to the additional node and from the additional node to the exit node are in 
every dimension. All the nodes are in the control of player 1 . 

• Module A2: The nodes of this module are the entrance and exit nodes, the nodes V of the reachability game G, 
and an additional node v*. The entrance node leads to the initial vertex of G with edge weight (i?, — v, 0, 0) (i.e., 
the first k dimensions are according to v, dimensions k + 1 to 2 • k are according to — V, and the last two dimen- 
sions are 0). For every edge e = (u, v) in G, there is such transition in A2 with weight (w(e), — w(e), — 1, +1). 
In addition, from every player- 1 vertex in V there is a transition to v* with weight in every dimension. In v* 
there is a self-loop transition with weight —1 in dimension 2k + 1, +1 in dimension 2k + 2 and in the rest of 
the dimensions; and in addition there is a transition to the exit node with weight in every dimension. 

The pictorial descriptions of the modules Aq, Ai, and A2 are shown in FigureQ] Figure|2j and Figure[3] respectively. 

Observation 1. The following observations hold: 

1. If player-1 strategy for module A\ is to never exit, then it is not a winning strategy (since the mean-payoff in 
dimension 2 ■ k + 2 will be — I.) 

2. If for a player-1 strategy T2for module A2, there is a play tt consistent with T2 that does not reach v* , then T2 is 
not a winning strategy {since the mean-payoff of p in dimension 2 ■ k + 1 will be —1.) 

Lemma 16. If player 1 does not have a winning strategy in the reachability game over "L k , then there is no modular 
winning strategy for player 1 in A. 

Proof. If player 1 does not have a winning strategy in the reachability game over Z fc , then let a be a player-2 winning 
strategy for the reachability game. We fix player-2 strategy for the modular game to be a according to the local history 
of A2 and claim that it is a winning strategy for player 2 in the WRG against the multi-dimensional mean-payoff 
objective for player 1. Indeed, let r = {ti, T2} be a player-1 modular strategy, and we consider the path 7r which is 
formed by playing according to r and a. By Observation Q] if 7r never exit A\ or never reach node v*, then player 2 
wins. Otherwise, since a is a winning strategy in the reachability game, we get that in the first subpath of 7r that leads 
from the entrance of A2 to v*, one of the dimensions 1 < i < 2 ■ k has a negative weight. We note that both a and r are 
modular strategies, and thus the path it is periodic and the mean-payoff of n in dimension i is negative. To conclude, 
if player 2 is the winner in the reachability game, then player 1 does not have a modular winning strategy in A. □ 

Lemma 17. If player 1 has a winning strategy in the reachability game, then there is a modular winning strategy for 
player 1 in A 

Proof. Let tq be a player-1 winning strategy for the reachability game. By Konig's Lemma there exists a fixed 
constant neN such that player 1 can assure the reachability objective, against every player-2 strategy, with at most n 
rounds. We now derive a modular winning strategy in A from tg- 

• Module At: Follow the self-loop edge for n rounds and exit. 

• Module A 2 : Follow strategy tq, until the weight in every dimension, according to the reachability game over 
G, is and a player-1 vertex was reached, and then go to v*. Let m be the number of rounds played according 
to tq in the current local history of A2, then player 1 follows the self-loop in v* for n — m times and goes to 
the exit node. 
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Figure 4: Module Ai for (x\ V -1X2 V £3) A (X2 V -1X3 V ^£4) 

-1 



It is easy to observe that any play according to the strategy above has a mean-payoff value of in every dimension. □ 

From Lemmas [T6l [P71 and [T31 we obtain the following result: 

Theorem 5. The problem of deciding the existence of a modular winning strategy in WRGs with multi-dimensional 
mean-payoff objectives is undecidable, even for hierarchical games (i.e., games without recursive calls), with six 
dimensions, three modules and with at most single exit for each module. 

In view of Theorem[5]we will focus on complexity and algorithms for WRGs under modular strategies for single- 
dimensional mean-payoff objectives. 

4.2 NP-hardness 

We consider WRGs under modular strategies with single dimensional mean-payoff objectives. It was already shown 
in lfl2 j that if the number of modules is not bounded, then even if all modules have at most one exit, the problem 
is NP-hard even when there is only player 1 and weights are restricted to { — 1,0,1}. We present a similar hardness 
result when the number of modules are restricted to only two, but the number of exits are not bounded. We present a 
simple log-space reduction from 3SAT to WRGs with two modules. The objective we will consider is the reachability 
objective, where the mean-payoff objective is satisfied once a vertex r is reached (i.e., r has a self-loop with weight 
and all other transitions have negative weight). 

The reduction. For a 3SAT formula ip(xi , . . . , x n ) = AHi C» we construct a WRG with two modules, namely A 
and A\. 

• Module A\. The module has 2n exits namely, Ex xi , Ex~, Xl , . . . , Ex Xn , Ex^ Xn , an entrance node that is owned 
by player 2, and n player-1 nodes x\, . . . , x n . From the entrance node there is a transition (En, Xi), for i = 
1, . . . ,n; and from every node Xi there is one transition to Ex Xi and one transition to Ex^ Xi . Intuitively, a 
modular strategy for player 1 is to decide on a True/False value for every X{. 

• Module A^: This is the initial module; it consists of m gadgets Ci, . . . , C m (note that these are gadgets - 
not modules), and two sink states, namely r and -ir, where r is the reachability objective. A gadget Ci = 
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Figure 5: Module Aq 



Hi V j/f V yf consists of three sub-gadgets, namely, y},y?,yf; gadget y\ invokes module A\ and the exits 
({Ex Xl , Ex^ Xl , . . . , Ex Xn , Ex^ Xn } \ {yf , ^yf}) of A\ leads to the good sink node r, the exit y\ leads to gadget 
Ci+i (or to node r if i = m), and the exit leads to sub-gadget yj +1 (or to the bad sink node -r if j = 3). 

The reduction is illustrated in Figure [4] and Figure [5] It is an easy observation that player 1 has a modular winning 
strategy iff the formula ip is satisfiable. 

Theorem 6. The decision problem of existence of modular winning strategies in WRG's with single dimensional 
mean-payoff objectives is NP-hard even for WRG's with two modules and weights restricted to {0, —1}. 

4.3 Algorithm for single dimensional mean-payoff objectives 

Given the undecidability result, we focus on WRGs with single dimensional mean-payoff objectives, and given the 
hardness results for either unbounded number of modules or unbounded number of exits, our goal is to present an 
algorithm that runs in polynomial time if both the number of modules and the number of exits are bounded. For 
the rest of this section we denote the number of game modules by M, the number of exits and boxes (in the entire 
graph) by E and B, respectively, and by n and m the maximal size of | Vi | and | Si \ (number of vertices and transitions) 
respectively that a module has. If M, E and W (the maximal absolute weight) are bounded, then our algorithm runs in 
polynomial time. We first present a theorem from [12| that will be useful in our result and then present the notion of 
cycle-free memoryless modular strategy. 

Theorem 7 ( H12I ). Given a WRG A with a single dimensional weight function, if there is a modular winning strategy 
for the objective LimAvg, then there is a memoryless modular winning strategy. 

Negative-cycle-free memoryless modular strategy. A player- 1 memoryless modular strategy r is called negative- 
cycle-free memoryless modular strategy if in the recursive graph A T there are no proper cycles C with negative 
weights, i.e., w(C) < 0. 
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Signature of a negative-cycle-free memoryless modular strategy. The signature of a negative-cycle-free memory- 
less modular strategy t = {Tj}^ is an M-tuple of function Sig(r) = {Sig^ : Ex{ ->ZU {— to, +oo}} J j / L 1 such that 
for an exit node x in module A.- L we have Sig^x) = z if 

• z € Z and the non-decreasing path with the minimal weight in A T from Erii to x (in the same stack height) has 
weight z. 

• z — +00 and there is no non-decreasing path in A T from Erii to x. 

• z = —u) and for every integer j there is a non-decreasing path in A T from Erii to x (at the same stack height), 
with weight at most j. 

The next lemma demonstrates an important property of signature functions. 

Lemma 18. Let I = (M • n) +1 ; let r be a negative-cycle-free memoryless modular strategy; and let W denotes 
the maximal weight (in absolute value) that occur in A. Then Sig(r) has the following property: 

For every i 6 {1, . . . , M}, the image (range) o/Sig^ is {— w, +00} U (Z Pi [— W ■ £,W ■ £]) 

Proof. We fix the strategy r in A, and obtain the player-2 recursive game graph A T . To show the result we need to 
prove that if there is a path (in A T ) from Erii (the entrance of Ai) to x e Ex.i with weight less than — W ■ £, then for 
every r G Z there exists a path from Erii to x, consistent with r, and with weight less than r; and that it is impossible 
that the path with the minimal weight from Erii to x has weight at least W ■ £ + 1. 

The proof is as follows: let tt be the shortest path in A T from Erii to x with weight w(tt) < — W ■ £ (note that 
tt corresponds to a play consistent with r). Since 10(71") < — W ■ £, it must be that \w\ > £, therefore tt must have a 
pumpable pair of paths, and since tt is the shortest path from Erii to x with such weight, the weight of the pumpable 
pair must be strictly negative, and thus we can construct paths from Erii to x with arbitrary small weights. 

Similarly, we show that if there is a path from Erii to x, then there is a path with weight at most W ■ £ — 1. Towards 
contradiction, let tt be the path with minimal weight between Erii and x and w(ir) >W-£ and tt is the shortest path 
with minimal weight. As |7r| > £ it follows that it has a pumpable pair of paths P. If w(P) > or w(P) < 0, then we 
get a contradiction to the fact that tt has minimal weight (either by omitting P if w(P) < or pumping P arbitrarily if 
w(P) > 0). If W{P) = 0, then we get a contradiction to the assumption that tt is the shortest path by simply omitting 
P. The desired result follows. □ 

Feasibility of signature. We say that a function Sig : Ex — > {—u>, +00} U Z is feasible if there is a negative-cycle- 
free memoryless modular strategy r such that Sig(r) = Sig. 

Lemma 19. Given a threshold vector v 6 ({— uj, +00} U Z) E , we can verify in (M ■ n)°( M ' E2 ' • time if there 

exists a feasible signature function Sig : Ex — > {— u>, +00} U Z such that Sig > v (i.e., for every x S Ex we have 
Sig(ar) > v x ). 

Proof. By Lemma [T8l we may assume that the input is restricted for V 6 ({— uj, +00} U (Z n [~W£, +W£])) E and 
Sig : i?:E — > {— w, +00} U (Z fl [— W£, +W£]). The proof of the lemma will use the idea of signature verification 
games. 

The signature verification games. For a recursive game A and a function Sig : Ex — > {—oj, +00} U Z we construct M 
game modules G\, . . . , Gm, such that Gi is formed from the module Ai by replacing every box b, that invokes module 
Aj and its fc-th return node leads to node Vk, with a player-2 node v\, and edges (vi,,Vk) with weight Sig^ (Exk). 
Intuitively every game module is like a finite-state game with thresholds for exit vertices. We first prove a claim 
related to signature verification games. 

Claim. For every game module G,; there exists a strategy r,; that satisfies Sig, i.e., it assures: 

• every path in Gp from Erii to Exj has weight at least S\gi(Exj); and 

• there are no cycles with negative weight in Gp ; 
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if and only if there exists a feasible signature function Sig' such that Sig' > Sig. 

Proof of claim. We prove both the directions of the claim. We start with the left to the right direction. By Theorem|7] 
such strategies {t;}^ exist iff there exist memoryless strategies {t*}^ that satisfies the above. Clearly, t* is also 
a modular strategy. In addition, for every path it in A T , the path does not contain negative proper cycles (and hence, 
t* is a negative-cycle-free strategy), and the path does not violates the constraints according to Sig. The proof is by a 
straight forward induction on the additional stack height of w. Hence we have Sig(r*) > Sig. The other direction is 
simpler. Clearly if there exists a negative-cycle-free modular strategy r such that Sig(r) > Sig, then T{ satisfies both 
items for every game module Gi. This proves the desired claim. 

The results of lfl3l Lemma 31] provides an algorithm that decides if for a given function / : Ex — » {— u, +00} U 
(Z n [— W£, +W£]) and a game module Gi there is a memoryless strategy that satisfies /; this is done by solving a 
(finite-state) mean-payoff game with single-dimensional objective with weights at most 2 • n ■ W ■ £. Hence, we can 
sequentially go over all the functions / : Ex —> {—u>, +00} U (Z n [— W£, +W£]) such that / > v and check if / is 
satisfiable. By the claim a signature Sig > Sexists if and only if such / was found. 

Complexity. The complexity analysis is as follows: there are (2 • W ■ £ + 2) E functions to verify; in the verification 
process we solve M mean-payoff games with weights at most 2 • n ■ W ■ £ and at most n vertices and m edges; and 
every mean-payoff game can be solved in 0(m ■ n 2 ■ W ■ £) time Q5). Thus the the overall complexity is 

0(n 2 • m • (W ■ £) E+1 ■ M) = o^m-e^+m-e+e+s . m . m m.e^ + m.e + e +2 . ^E+ij = (M . u) o(m.e=) . w o(e) 
The desired result follows. □ 

Reduction from modular games to signature problem. Intuitively, for a given WRG A, we would like to construct 
a new WRG A', such that player 1 is the winner in A iff there exists a feasible signature in A' with certain proper- 
ties. We construct A' in the following way: Let (A\, . . . , Am) be the modules of A, then we construct the modules 
(A[, . . . , A' M ) from (Ax, . . . , Am) as follows: 

• Add M exit nodes X\ , . . . , xm for every module. 

• For every box node b, in module Aj, if b invokes module A;, then for all k ^ i, the exit Xi is connected (by an 
edge with weight 0) to the exit Xi in the module Aj, and if k = i, then the exit leads to a sink state (and the 
weight of the self-loop is positive). 

• W.l.o.g we assume that all the entrances are player-2 nodes, and we add edges with zero weight from each 
entrance to all the new exits x±, . . . , xm ■ 

We note that the number of exits E' in A' is E + M 2 . The following lemma establishes winning in A and properties 
of signature function in A'. 

Lemma 20. Player 1 has a memoryless modular winning strategy in A iff there is a feasible signature Sig in A 1 such 
that for every module A[ we have Sig^x,;) > 0. 

Proof. We first prove the direction from left to right. Let r be a memoryless modular winning strategy (and therefore 
also negative-cycle-free) in A. We note that r is a modular negative-cycle free strategy also for A'. We claim that (the 
feasible signature function) Sig = Sig(r) satisfies Sig^Xj) > 0. Indeed, if Sig^Xj) < 0, then by the construction of 
A', there is a play p from Erii to Erii (at an higher stack height) with negative weight, that is consistent with r. Since 
r is a modular strategy we get that p u is a play with a negative mean-payoff that is consistent with r, which contradicts 
the assumption that r is a winning strategy. 

To prove the converse direction, let r be a memoryless negative-cycle-free strategy in A' such that Sig(r) = Sig. 
We note that r is a modular strategy also for A and we claim that it is a winning strategy for A. Indeed, let A T be the 
player-2 game according to r; if in A T there is a path with negative mean-payoff then either 

• there is a proper cycle in A T with negative weight, which contradicts the assumption that r is negative-cycle-free 
strategy; or 
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• there is a non-decreasing cycle A T with negative weight. If this is the case then for some module Ai there is a 
non-decreasing path in A T from Erii to Erii with negative weight, and thus in A' T there is a path with negative 
weight from Erii to ir, and therefore Sig i (^) < 0, in contradiction to the assumption. 

The desired result follows. □ 

Theorem 8. Given a WRG A with a single dimensional mean-payoff objective, whether player 1 has a modular 
winning strategy can be decided in (n ■ M)°( M + M E ) . W°( M +E ) time. 

Proof. We first construct the modular game graph A' and then we check if there is a signature function Sig such that 
Sig^a;^) > for every i £ {1, . . . , M}. The correctness and complexity follows from Lemma |20l and Lemma[T9l □ 

4.4 Hardness for fixed parameter tractability 

Given Theorem[8] (algorithm to solve in polynomial time when M and E are fixed) an interesting question is whether 
it is possible to show that WRGs under modular strategies is fixed parameter tractable (i.e., to obtain an algorithm 
that runs in time 0(/(M, E) • poly(n, m, W))). We show the hardness of fixed parameter tractability, again by a 
reduction from parity games, implying that fixed parameter tractability would imply the solution of the long-standing 
open problem of fixed parameter tractability of parity games. 

Parity games to mean-payoff games with large weights. In 11241 a reduction of finite-state parity games 
to finite-state mean-payoff games was presented, and the weights for the mean-payoff game used were 
{(— n)°, (—n) 1 , . . . , (— n) 1 , . . . , (— n) k }, where k is the number of priorities of the parity function. The reduction 
was a 0(k ■ n ■ log n) time reduction. 

The reduction. Given a finite state mean-payoff game G with n vertices and weights 

(— n)°, (— n) 1 , . . . , (— n) 1 , . . . , (—n) k we construct a recursive game graph A = (Aq, Pi, . . . , Pp., Ni, . . . , Nk) 
with 2 ■ k + 1 modules in the following way. 

• The Pi modules: all the nodes in the Pi modules have out-degree 1 (so the owner is irrelevant), and all the 
modules have only one exit. In module Pi the out-edge of the entrance node leads to the exit node and has 
weight +n (equivalently, it has a path with length n to the exit node, and the weight of each edge in the path is 
+1). For i > 1, the module Pi invokes n times the module Pj_i and goes to the exit node. 

• The Ni modules: all the nodes in the Ni modules have out-degree 1 (so the owner is irrelevant), and all the 
modules have only one exit. In module Ni the out-edge of the entrance node leads to the exit node and has 
weight — n (equivalently, it has a path with length n to the exit node, and the weight of each edge in the path is 
— 1). For i > 1, the module N invokes n times the module A^_i and goes to the exit node. 

• The Ao module: Aq is formed from the vertices of the finite state game graph G, and every transition (u, v) in 
G, with weight (— n) % is replaced by a transition from u to a box b and by a transition from the return node of b 
to v (both with weight 0), where b invokes Pi if i is even, and invokes Ni if i is odd. 

Remark 4. The path from the entrance of module Pi (resp. Ni) to its exit has weight n l (resp. — (n 1 )). 

Proof. The proof is by a trivial induction on i. □ 

We observe that all strategies in A are modular strategies, and that a modular winning strategy in A is a winning 
strategy in G, and vice versa. We have the following result. 

Theorem 9. Given a finite-state parity game G with n vertices and priority function of k-priorities, we can construct 
in polynomial time a WRG A with 2 • k + 1 modules, with 0(k ■ n) nodes and weights restricted to {—1, 0, +1} such 
that a vertex v is winning for player 1 in the parity game iff there is a modular winning strategy in A with v as the 
initial node. 
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